Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a mechanism for converting SQL schemas based on negotiated version #2417

Merged
merged 9 commits into from
Jan 16, 2025

Conversation

jackkleeman
Copy link
Contributor

@jackkleeman jackkleeman commented Dec 12, 2024

Currently there are 2 ways that old CLIs will break against 1.2

  1. COUNT(*) queries no longer return nullable values. The old CLI expects that they do for a particular field.
  2. We no longer convert LargeUtf8 and LargeBinary to their non large equivalents. The old CLI expects that we do.

And i want to introduce a new way, by encoding our timestamp data correctly instead of as date64.

To cover these 3 breaking changes, I have added a mechanism to convert schemas on the fly based on negotiated version. CLI clients will start providing their negotiated version as a header x-restate-admin-api-version. CLI user-agent clients that don't do so will be presumed to be on version 1. Other clients that don't do so will be presumed to be happy with the latest version. This is still a slight breaking change if someone is depending on certain schema types, but they can provide x-restate-admin-api-version: 1 to maintain the current behaviour.

NB: this PR does not cover newer CLIs talking to older restate clusters. In my opinion we should move the CLI to use JSON for the sql operations, which is itself a somewhat breaking change as json is only present from 1.1.3, but once thats sorted, these conversions become mostly irrelevant for new CLIs.

Copy link
Contributor

@tillrohrmann tillrohrmann left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot for repairing support for older clients @jackkleeman. The changes look good to me. The two relevant questions I had were whether we settled on encoding the version via header values and whether no header value should mean the latest version. For the latter, I can see that is very convenient when starting (because you don't have to think about versions) but once you upgrade, you might be bitten by it. Is this a common practice that people would be aware of?

Regarding whether the CLI 1.2 should be able to talk to a server 1.1: It would certainly be nice but I am unsure about the added complexity at this point. Having a meaningful error message telling users what to do might be good enough. I guess where it becomes inconvenient is when one is interacting with different Restate server versions (e.g. local dev is 1.2 and production is 1.1).

cli/src/clients/datafusion_helpers.rs Show resolved Hide resolved
cli/src/clients/datafusion_helpers.rs Outdated Show resolved Hide resolved
Comment on lines 32 to 33
AdminApiVersion::HEADER_NAME,
http::HeaderValue::from(value.as_repr()),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did we come to a conclusion how to encode the different versions (header vs. path)?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we have not

Comment on lines 49 to 39
match headers.get(Self::HEADER_NAME) {
Some(value) => match value.to_str() {
Ok(value) => match value.parse::<u16>() {
Ok(value) => match Self::try_from(value) {
Ok(value) => value,
Err(_) => Self::Unknown,
},
Err(_) => Self::Unknown,
},
Err(_) => Self::Unknown,
},
// CLI didn't used to send the version header, but if we know its the CLI, then we can treat that as V1
None if is_cli => Self::V1,
None => Self::Unknown,
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Personally, I am not a huge fan of this kind of special casing because it can be confusing.

Is it standard practice to consider a request w/o a version header as the latest? While this is very convenient, it seems also like the best way to run into breaking changes because initially users don't have to think about versions until they upgrade.

Would it be an option to always consider a missing version header as v1?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think it is not standard practice. i think if its a header, we need to make sure that the naive curl case without the header does something sensible. if its a url path, we could perhaps deprecate the unversioned one, but given that we are versioning the whole api, it could mean fairly regularly breaking old scripts/clients which use a versioned url, when we remove support for an old api version. i don't really have a great answer here!

Copy link
Contributor

@tillrohrmann tillrohrmann Dec 16, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, the versioning approach via headers makes it a bit more hidden and I agree that the basic curl case should still work. At the same time, I would like to avoid a very obvious foot gun if possible.

Regarding deprecating a versioned url, I wouldn't worry too much about it. This is something that we can announce and then slowly phase out to give users time to adjust. This won't be so easily possible for the API that always serves the latest version (unless it is announced in advance).

crates/admin/src/storage_query/convert.rs Outdated Show resolved Hide resolved
crates/admin/src/storage_query/convert.rs Outdated Show resolved Hide resolved
crates/admin/src/storage_query/convert.rs Show resolved Hide resolved
crates/admin/src/storage_query/convert.rs Outdated Show resolved Hide resolved
Copy link
Contributor

@igalshilman igalshilman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes look good to me.

*old cli's (identified via a user agent) --> precisely identified (not a heuristic)

  • no version specified ==> means latest.
  • the server and the cli negotiate a version.
  • curl at al, always use latest, unless specified otherwise
  • overtime we drop support for v1 and then we won't need to converters.

@jackkleeman jackkleeman force-pushed the sql-schema-converters branch from ec47375 to ca9abd8 Compare January 16, 2025 14:23
converter for the missing ty field in handler metadata
@jackkleeman jackkleeman force-pushed the sql-schema-converters branch from ca9abd8 to 8109f7a Compare January 16, 2025 14:37
@jackkleeman jackkleeman merged commit b24db8b into restatedev:main Jan 16, 2025
14 checks passed
@jackkleeman jackkleeman deleted the sql-schema-converters branch January 16, 2025 16:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants